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Abstracts 

Background: Rhesus macaques living in western Sichuan, China, have been separated into several isolated populations 
due to habitat fragmentation. Previous studies based on the neutral or nearly neutral markers (mitochondrial DNA or 
microsatellites) showed high levels of genetic diversity and moderate genetic differentiation in the Sichuan rhesus 
macaques. Variation at the major histocompatibility complex (MHC) loci is widely accepted as being maintained by 
balancing selection, even with a low level of neutral variability in some species. However, in small and isolated or 
bottlenecked populations, balancing selection may be overwhelmed by genetic drift. To estimate microevolutionary 
forces acting on the isolated rhesus macaque populations, we examined genetic variation at Mhc-DQBl loci in 1 19 wild 
rhesus macaques from five geographically isolated populations in western Sichuan, China, and compared the levels of 
MHC variation and differentiation among populations with that previously observed at neutral microsatellite markers. 

Results: 23 Mamu-DQBl alleles were identified in 1 19 rhesus macaques in western Sichuan, China. These macaques 
exhibited relatively high levels of genetic diversity at Mamu-DQBl. The Hanyuan population presented the highest genetic 
variation, whereas the Heishui population was the lowest. Analysis of molecular variance (AMOVA) and pairwise F ST values 
showed moderate genetic differentiation occurring among the five populations at the Mhc-DQBl locus. Non-synonymous 
substitutions occurred at a higher frequency than synonymous substitutions in the peptide binding region. Levels of MHC 
variation within rhesus macaque populations are concordant with microsatellite variation. On the phylogenetic tree for the 
rhesus and crab-eating macaques, extensive allele or allelic lineage sharing is observed betweenthe two species. 

Conclusions: Phylogenetic analyses confirm the apparent trans-species model of evolution of the Mhc-DQBl genes in these 
macaques. Balancing selection plays an important role in sharing allelic lineages between species, but genetic drift may 
share balancing selection dominance to maintain MHC diversity. Great divergence at neutral or adaptive markers showed 
that moderate genetic differentiation had occurred in rhesus macaque populations in western Sichuan, China, due to the 
habitat fragmentation caused by long-term geographic barriers and human activity. The Heishui population should be paid 
more attention for its lowest level of genetic diversity and relatively great divergence from others. 
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evolution 
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Background 

How genetic variation is maintained within populations of 
endangered species is a central issue in evolutionary and 
conservation biology [1]. Diversity as a measure of individ- 
ual variation within a population is widely thought to re- 
flect the number of different genotypes in the population, 
taking into account their frequencies [2]. Neutral markers 
are often a measure of genetic diversity, but neutral gen- 
etic variations do not always correlate with variations at 
adaptively important genes [3] . Therefore, the importance 
of genetic diversity at markers of adaptive significance has 
been increasingly recognized [4,5]. The multi-gene major 
histocompatibility complex (MHC) family is found in ver- 
tebrates, coding for cell surface glycoproteins and is im- 
portant in animal conservation due to its role in resisting 
pathogens [6]. The classical MHC genes of rhesus ma- 
caques can be divided into MHC class I and II genes. The 
MHC class I genes include mainly -A and -B alleles, and 
MHC class II genes include mainly -DM, -DO, -DP, -DQ, 
and -DR alleles. MHC genes are well-known examples of 
genes of adaptive significance and are particularly relevant 
to conservation [7]. Consequently, MHC variability is a 
reflection of the processes that are related to adaptive evo- 
lution within and between populations [8], and the gener- 
ation and maintenance of allelic polymorphisms in the 
MHC genes is a major topic in evolutionary genetics. The 
high levels of polymorphisms usually observed at MHC 
genes are most likely to be maintained by balancing selec- 
tion, and driven largely by exposure to a diversity of path- 
ogens [9-11], but some studies on MHC variation in 
natural populations show that balancing selection does 
not always maintain high levels of MHC variation relative 
to neutral markers [12,13]. In particular in small popula- 
tions, the strength of balancing selection on MHC genes 
may be weak relative to other microevolutionary forces 
such as genetic drift [6,14]. Genetic drift is the random 
fluctuation of allele frequencies over time, thus, adaptive 
alleles may be lost, and deleterious alleles could be fixed in 
the population. Many natural populations are threatened 
by intense reduction and fragmentation of habitat, which 
leads to population isolation, decline in the number and 
the loss of genetic diversity [15]. The small population size 
and fixation of deleterious alleles leads to inbreeding depres- 
sion and reduction of individual fitness, which decreases via- 
bility and compromises a populations evolutionary adaptive 
potential [16]. The loss of genetic diversity may increase the 
risk of extinction due to decreased reproductive fitness, 
decreased adaptive flexibility, and increased disease sus- 
ceptibility [17]. An important assumption in conservation 
genetics is that small and isolated populations are more 
sensitive to genetic drift and inbreeding [18,19]. Clarifying 
the mechanism that determines genetic variation in small 
and isolated populations is therefore essential for species 
conservation [19]. 



Western Sichuan, located along the southeastern edge of 
the Qinghai-Tibetan Plateau (QTP) and the northern section 
of Hengduan Mountains, is the global biodiversity hotspots. 
Affected by the Pliocene uplifting of the QTP, western 
Sichuan formed a complex and diverse topography with high 
mountains and deep valleys. The region, from southwest to 
northeast, has four major mountains, namely, Liangshan, 
Xiangling, Qionglai, and Mingshan and four large rivers, 
namely, Jinshajiang, Yalongjiang, Daduhe, Mingjiang and 
Jialingjiang (Figure 1). Additionally, there are many human 
settlements and roads in the region. Thus, many wild ani- 
mals and plants - endangered species in particular - have 
actually formed fragmented distributions in western Sichuan 
due to the barrier effect of these natural and man-made 
impediments. The rhesus macaque {Macaca mulatto) is 
mainly found throughout most of southern Asia crossing 
eastern Afghanistan, Bangladesh, Bhutan, northern and cen- 
tral India, central and southern China, Lao PDR, Myanmar, 
Nepal, northern Pakistan, northern Thailand, and Vietnam, 
and is a endangered species due to the sharp reduction in 
the number throughout its wider distribution in recent 
years. This species is listed in CITES Appendix II, Schedule 
III of the Bangladesh Wildlife (Preservation) Act (1974), 
Schedule I of Part I of the Indian Wildlife (Protection) Act 
(amended up to 2002), and Category II of the Chinese 
Wildlife Protection Act (1989). According to Jiang [20], 
Chinese rhesus macaques were divided into six subspecies. 
Rhesus macaques in western Sichuan are classified as a sep- 
arate subspecies and designated as M. mulatta lasiotis, 
which was isolated from the other five subspecies by human 
activities and natural barriers (Yangtze River) and has been 
fragmentized in western Sichuan mainly due to the special 
topography in this region (Figure 1). Therefore, western 
Sichuan is a suitable place to explore how microevolution- 
ary forces such as balancing selection and genetic drift 
maintain the variation pattern of rhesus macaque popula- 
tions in western China. Understanding evolutionary pro- 
cesses that maintain genetic diversity in natural populations is 
an essential goal of population genetics [21,22]. Neutral or 
nearly neutral markers such as mitochondrial or microsatel- 
lite DNA are informative for phylogenetic and phylogeo- 
graphic reconstructions [23]. Genetic diversity analysis using 
mitochondrial DNA markers showed that significant genetic 
differentiation had occurred among different isolated pop- 
ulations of Sichuan rhesus macaques [24]. Subsequent 
microsatellite loci analysis also demonstrated that the genetic 
differentiation has occurred between a local (Heishui) popula- 
tion and other populations [25]. Another study, using a large 
dataset of maternally inherited mitochondrial DNA gene se- 
quences and nuclear microsatellite DNA data, revealed two 
maternal super haplogroups that exist in Chinese rhesus ma- 
caques, one in eastern China and another in western China, 
and further analysis showed that orogenesis likely drove the 
divergence of western populations in China [26]. 
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The rhesus macaque, cynomolgus macaque, common 
marmoset and owl monkey are non-human primates that 
are often used as experimental model animals for biomed- 
ical research, such as transplantation studies and vaccine 
development against measles, Ebola, or other infectious 
diseases including acquired immune deficiency syndrome 
(AIDS) [27,28]. For use in immune- related studies, it is ne- 
cessary that the MHC background of these animals be 
characterized. Until now, various MHC genes including 
class I and II have been identified and characterized in dif- 
ferent degrees for these non-human primate animals. For 
example, 14 Aona-DQBl alleles were identified that sepa- 
rated the two lineages Aona~DQBl*22 and Aona-DQBl*23 
in a group of 19 unrelated owl monkeys (Aotus nancymaae) 
[29]; Otting et al. [30] sequenced exon 2 of the Mhc-DQB 
gene in each of a random panel of 60 non-pedigreed cyno- 
molgous macaques (Macaca fascicular is), resulting in the 
detection of 23 Mafa-DQBl alleles that had not previously 
been published and confirmed the trans-species model of 
evolution of the Mhc-DQB lineages, in which a group of 
major allele lineages is shared by different species of non- 
human primates in the phylogeny. Qiu et al. [31] identified 
37 MhcMamu-DQBl alleles in 105 Chinese rhesus ma- 
caques by cloning and sequencing, illustrating a marked al- 
lelic polymorphism at DQB1 in these monkeys. Notably, in 



recent years more MHC class II DQB alleles have been 
identified and characterized in two closely related monkeys, 
rhesus and cynomolgous macaques, due to their common 
use as experimental animal models in biomedical research. 

The above work provided us with the inspiration and base 
to explore the genetic variation patterns of rhesus macaques 
in western China based on MHC genes with adaptive signifi- 
cance because of their role in pathogen resistance. The aims 
of this study included the following: 1) to investigate poly- 
morphism levels of MHC class II DQB1 locus in five isolated 
populations of rhesus macaques in western Sichuan, China; 
2) to estimate the degree of genetic differentiation among or 
within populations; 3) to determine the relative roles of 
microevolutionary forces such as balancing selection or gen- 
etic drift in populations; and 4) to confirm the trans-species 
model of evolution of Mhc-DQBl lineages between rhesus 
and cynomolgous macaques. 

Results 

Frequency distribution of Mamu-DQBI alleles 

A total of 119 rhesus macaque individuals in western 
Sichuan were typed into 49 genotypes through the polymer- 
ase chain reaction (PCR) and single-strand conformation 
polymorphism (SSCP) analysis based on the polymorphism 
of the second exon of the Mamu-DQBI gene, and then 
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cloning and sequencing of the second exon of the Mamu- 
DQBl gene were conducted for the samples representing 
the 49 genotypes. Consequently, 23 Mamu-DQBl alleles 
were identified in Sichuan rhesus macaques (Table 1). Each 
Mamu-DQBl sequence obtained was 274 bp (excluding 
primers) in length and spanned the complete sequence of 
exon 2. No more than two alleles were detected in each in- 
dividual, and no stop codon or insertion/deletion was ob- 
served in Mamu-DQBl. By blast searches in the IPD-MHC 
Database (http://www.ebi.ac.uk/ipd/mhc/), the 23 allelic se- 
quences were found to be completely identical to some des- 
ignated alleles, showing that these alleles also existed in 
other rhesus macaque individuals in previous study. 

The number and frequency of the Mamu-DQBl alleles 
varied substantially in five different geographical popula- 
tions (Table 1 and Figure 2). The highest frequency was 
Mamu-DQBV614 among these rhesus macaques, which 
was found in 31 (13.03%) of the 119 monkeys; the next 
most common frequency was Mamu-DQB*1503, which 
was detected in 30 (12.61%) of these monkeys. The third 



frequency was Mamu-DQB*1703, which was detected in 29 
(24.37%) of these monkeys. Three of the five geographical 
populations had their unique alleles. Mamu-DQB*1502, 
Mamu-DQB*1601, Mamu-DQB* 170601, Mamu-DQB*1804, 
and Mamu-DQB*1819 were found only in the Hanyuan 
population, Mamu-DQB*061301 and Mamu-DQB*1818 were 
found only in the Danba population, and Mamu-DQW0607 
was found only in the Jiulong population. Mamu-DQW0614 
was found in all populations at different frequency from 
1.79% to 21.43%. Mamu-DQB*061301 was detected only in 
one individual. 

Genetic diversity of the rhesus macaque populations 

In order to estimate genetic diversity of the rhesus ma- 
caques in western Sichuan based on the adaptive marker, 
several genetic diversity parameters such as haplotype di- 
versity (h), nucleotide diversity (ti), expected heterozygosity 
(He) and allelic richness (A R ) were calculated (Table 2). The 
values of h, He, and A R were 0.718 ~ 0.891, 0.780 ~ 0.875, 
and 5.733 ~ 11.938, respectively, indicating a relatively high 



Table 1 Distribution of allelic frequencies for Mamu-DQBl in five rhesus macaque populations 



Alleles 


Total (n 


= 119) 


JL(n 


= 20) 


HY (n 


= 28) 


DB (n 


= 14) 


XJ (n 


= 27) 


HS (n 


= 30) 




NA 


AF 


NA 


AF 


NA 


AF 


NA 


AF 


NA 


AF 


NA 


AF 


1. Mamu-DQB1*0605 


19 


7.98 


17 


42.50 


2 


3.57 














2. Mamu-DQB1*0606 


25 


10.50 


2 


5.00 






7 


25.00 


13 


24.07 


3 


5.00 


3. Mamu-DQBl *0607 


3 


1.26 


3 


7.50 


















4. Mamu-DQB1*0610 


3 


1.26 


1 


2.50 


2 


3.57 














5. Mamu-DQBl *061 101 


13 


5.46 


1 


2.50 






5 


17.86 


1 


1.85 


6 


10.00 


6. Mamu-DQB1*061102 


2 


0.84 






1 


1.79 






1 


1.85 






7. Mamu-DQB1*061301 


1 


0.42 










1 


3.57 










8. Mamu-DQB1*0614 


31 


13.03 


1 


2.50 


1 


1.79 


6 


21.43 


4 


7.41 


19 


31.67 


9. Mamu-DQB1*0617 


8 


3.36 














3 


5.56 


5 


8.33 


10. Mamu-DQBl *1 501 


6 


2.52 


4 


10.00 


1 


1.79 






1 


1.85 






1 1. Mamu-DQB1*1502 


4 


1.68 






4 


7.14 














12. Mamu-DQBl *1 503 


30 


12.61 






7 


12.50 


3 


10.71 


19 


35.19 


1 


1.67 


13. Mamu-DQBl *1 601 


2 


0.84 






2 


3.57 














14. Mamu-DQBl *1 703 


29 


12.18 














4 


7.41 


25 


41.67 


15. Mamu-DQBl *1 70601 


3 


1.26 






3 


5.36 














16. Mamu-DQB1*1709 


5 


2.10 


4 


10.00 














1 


1.67 


17. Mamu-DQBl *1 801 


8 


3.36 


3 


7.50 


3 


5.36 


2 


7.14 










18. Mamu-DQB1*1804 


5 


2.10 






5 


8.93 














19. Mamu-DQBl *1 810 


7 


2.94 


1 


2.50 


2 


3.57 






4 


7.41 






20. Mamu-DQBl *1 811 


23 


9.66 


2 


5.00 


15 


26.79 


2 


7.14 


4 


7.41 






21. Mamu-DQB1*1812 


3 


1.26 


1 


2.50 


2 


3.57 














22. Mamu-DQB1*1818 


2 


0.84 










2 


7.14 










23. Mamu-DQB1*1819 


6 


2.52 






6 


10.71 















Notes: NA, number of alleles; AF, frequency of alleles (Gene frequency shown in the table was the percentage); n, number of individual. Shot line (hyphen) means 
zero. The bold allelic name showed 100% identity between Mamu-DQBl and Mafa-DQB1 nucleotide sequences. JL, Jiulong; HY, Hanyuan; DB, Danba; XJ, Xiaojin; HS, 
Heishui. Asterisk* is only a separate symbol between Mamu-DQB1 and allele number in the allele name. 
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Figure 2 Histogram of MhcMamu-DQBl frequency distribution in populations. The y-axis represents the alliic frequency and x-axis represents 
23 MhcMmamu-DQBl alleles, No. 1-23 presents each allele name in Table 1; JL, Jiulong; HY, Hanyuan; DB, Danba; XJ, Xiaojin; HS, Heishui. Red asterisk (*) 
represents group-specific alleles in this population. 



level of genetic diversity at Mamu-DQBl locus. The ana- 
lysis of nucleotide diversity revealed that most of the nu- 
cleotide variation was located in the antigen-binding site 
(ABS) region, and there were at least 20 nucleotide varia- 
tions on average in each population. The Hanyuan popula- 
tion exhibited the highest genetic diversity: 15 allele 
haplotypes were observed in 28 animal individuals. The 
Heishui population exhibited the lowest genetic diversity: 
only 7 allele haplotypes were observed in 30 animal individ- 
uals. Surprisingly, the inbreeding coefficient (F is ) in the 
Hanyuan population was also the highest, whereas it was 
lowest in Heishui population. 



Phylogenetic analysis 

The phylogenetic tree was constructed for 156 Mhc-DQBl 
gene sequences, including 69 Mamu-DQBl, 85 Mafa- 
DQB1 and 2 Aona-DQBl alleles by using the neighbor- 
joining method (Figure 3), and aona-DQBl alleles were used 
as the out-group. The 156 sequences can be searched in the 
IPD-MHC Database (http://www.ebi.ac.uk/ipd/mhc/). As is 
evident from the tree, the majority of Mhc-DQBl alleles 
tended to cluster together according to lineage group rather 
than species or sample origins, revealing a trans-species 
model of evolution. Twenty-three Mamu-DQBl alleles were 
classed into five lineages: DQB1V6, *75, *i6, 9 17, and *18, of 
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Table 2 Genetic diversity of Mamu-DQBl within rhesus macaque populations 



Population 


L 


h 


77 


K 


Ho 


He 


Ar 


Fis 


M-He 


M-Rs 


JL 


12 


0.800 


0.07688 


20.7564 


0.600 


0.780 


10.275 


0.255 


0.73 


6.27 


HY 


15 


0.891 


0.09296 


25.1000 


0.607 


0.875 


1 1 .938 


0.323 


0.72 


6.40 


DB 


8 


0.862 


0.08600 


23.2196 


0.786 


0.832 


8.000 


0.092 


0.68 


5.68 


XJ 


10 


0.807 


0.10219 


27.5919 


0.741 


0.792 


8.262 


0.084 


0.70 


6.20 


HS 


7 


0.718 


0.08387 


22.6441 


0.833 


0.706 


5.733 


-0.164 


0.67 


5.27 



Notes: L, number of alleles; h, haplotype diversity; n, nucleotide diversity; K: Average number of nucleotide differences; Ho, observed heterozygosities; He, 
expected heterozygosities; Fis, the inbreeding coefficient; A R , estimates of allelic richness. M-He is the mean heterozygosity of Microsatellite alleles and M-Rs is the 
mean allelic number across 30 microsatellite loci per population from Li et al. [25]. JL, Jiulong; HY, Hanyuan; DB, Danba; XJ, Xiaojin; HS, Heishui. 



which lineage DQB1V6 and *78 were most dominant (9/ 
23 = 39% and 7/23 = 30%, respectively). Almost all previ- 
ously reported major Mamu-DQBl lineages were detected 
(DQB1V6, '"IS, *17, *18) 9 except for DQBF24 lineages that 
were not found in the western Sichuan macaques examined 
in this study. It is worth noting that rhesus macaques 
shared only 14 identical alleles with crab-eating macaques 



on the phylogenetic tree, but there were 10 alleles to be de- 
tected in this study (see Table 1). 

Selective pressure analysis 

To analyze the selective pressure on Mamu-DQBl exon 2 
sequences in western Sichuan rhesus populations, the aver- 
age non-synonymous and synonymous substitution rates 



CO 



535 



Mafa-DQBr 170602 
Mafa-DQB1 *1 70601 
Mamu-DQB1*1713Q 
Mamu-DQB1M70602Q 




Mafa-DQB1*0627 
O Mamu-DQB1*0618 
O Mamu-DQB 1*0620 
• Mamu-DQB 1*0605 

Wafe-DQBroeae 



Figure 3 Phylogenetic tree of the MhcMamu- and MhcMafa-DQBI alleles using neighbour-joing methods. 23 Mamu-DQB1 sequences 
detected in this study are shown with a solid round spot. 46 Momu-DQBl alleles not detected in animals of this study are showed with a circle. 
85 Mofo-DQBl sequences are showed without anything. 2 Aono-DQBl (Aona-DQB1*220101 and Aona-DQB1*220102) were selected as outgroup. 
Except for 23 Mamu-DQB1 sequences detected in this study, all other DQB1 sequences were retrieved from the IPD-MHC Database 
(http://www.ebi.ac.uk/ipd/mhc/). Numbers at the node are bootstrap value in statistic analysis, only values above 50% are showed. 
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(d N /ds, co) for all sites, antigen-binding sites (ABS), and 
non-antigen binding sites (non-ABS) were calculated 
(Table 3, Dataset A: 23 sequences). The highest co ratio 
value was found in ABS and was greater than one (co = 
1.458), whereas the lowest co ratio value was found in non- 
ABS and less than one (co = 0.976), but these values did not 
significantly deviate from the neutral expectation in the 
Z-test (p > 0.05). To better understand the substitution pat- 
tern of Mamu-DQBl exon 2 sequences, 46 Mamu-DQBl 
exon 2 sequences from the previous studies were added to 
our sequence data to expand our analysis (Table 3, Dataset 
B: 65 sequences). The same analysis with Dataset A was 
performed. We found that all co ratio values for dataset B 
were greater than that of Dataset A and greater than one, 
especially the co ratio value in the ABS of Dataset B was 
found to be significantly greater than one (co = 2.041, p < 
0.05). These results indicated that the ABS of Mamu-DQBl 
exon 2 sequences had been subjected to a strong balancing 
selection, but the selection signal was not significant in the 
Mamu-DQBl exon 2 sequences of western Sichuan rhesus 
macaques. 

Genetic differentiation among five rhesus-macaque 
populations 

From the analysis of molecular variance (AMOVA, 
Table 4) for the five populations, 85.29% of the total vari- 
ation in our dataset was observed within populations and 
14.71% was observed among populations, showing that 
the genetic variation primarily came from within popula- 
tion. Although the genetic differentiation within a popula- 
tion was significantly higher than among populations, 
there was still a moderate differentiation among popula- 
tions because of a larger fixation index (F sx ) (0.14712, p < 
0.01). To further assess the genetic differentiation among 
populations, the pairwise comparison F S t and gene flow 
(Nm) values among populations were calculated (Table 5). 
The degree of genetic differentiation was smallest between 

Table 3 Mean numbers of nucleotide substitutions per 
non-synonymous site (c/ N ) and synonymous site (c/ s ) and 
the d N /d s ratios (w) 

Mamu-DQBl Sites n cf N d s w P 

Dataset A ABS 16 0.312±0.073 0.214±0.089 1.458 0.163 

Non-ABS 73 0.082 ±0.01 8 0.084 ±0.021 0.976 0.483 

All 89 0.1 18 ±0.020 0.1 03 ±0.020 1.146 0.279 

Dataset B ABS 16 0.251 ±0.081 0.123±0.068 2.041 0.039* 

Non-ABS 73 0.082 ±0.01 8 0.078 ±0.01 7 1.051 0.427 

All 89 0.1 06 ±0.020 0.085 ±0.01 7 1.247 0.192 

Notes: Dataset A represented 23 Mamu-DQBl alleles in this study;Dataset B 
represented currently 69 Mamu-DQBl alleles including 46 alleles from IPD. 
ABS, the putative amino acid residues involved in antigen binding region (site 
9, 1 1, 13, 28, 47, 57, 61, 67, 70, 71, 74, 78, 85, 86, 89 and 90), were identified 
according to the corresponding antigen binding sites identified in human 
HLA-DR and DQ structure (72). n, the number of codons encoding amino acid; 
Statistical significance is indicated by the asterisks (P <0.05). 



the Danba and Xiaojin populations, whereas the Heishui 
and Jiulong populations had the largest differentiation. 
The Heishui population had relatively little gene exchange 
with others. 

To further explore genetic differentiation, we compared 
He and A R between microsatellite and Mamu-DQBl 
markers (Figure 4). The richness of MHC sequences per 
population provides a measure of allelic diversity within 
each population; the A R values of both the microsatellite 
sites and Mamu-DQBl are significantly positively corre- 
lated (r 2 = 0.961, df=4,p = 0.003). However, there is not a 
significant correlation between microsatellite sites and 
Mamu-DQBl heterozygosity per population (r 2 = 0.2293, 
df= 4, p = 0.4145). No significant correlation to pairwise 
F ST between microsatellite sites and Mamu-DQBl (r 2 = 
0.2198, df= 9,p = 0.1716). The F ST values of Mamu-DQBl 
showed higher divergence among populations when 
compared with microsatellite loci (except for the 
Danba-Hanyuan and Danba-Xiaojin populations), and all 
of the F ST values (except for that of the Danba-Xiaojin 
population) at the Mamu-DQBl locus were higher than 
0.05, indicating that divergence existed among popula- 
tions, especially between the Heishui and Jiulong popula- 
tions, which had the largest F ST . Overally, the population 
difference was more obvious at the Mamu-DQBl locus 
than the microsatelites: six F S t values from the Mamu- 
DQBl locus were far larger than that of the microsatellites 
in ten pariwise population comparisons (Table 5). 

Discussion 

Genetic diversity of Mamu-MhcDQBl locus 

In various studies, the maintenance of a high level of gen- 
etic diversity is assumed to be a preliminary strategy for 
the population conservation [32-34]. More specifically, 
high genetic diversity has often been shown to be posi- 
tively correlated with indicator values of individual fitness 
such as the mating success of animals and adaptability to 
changing environmental conditions [35,36]. In accordance 
with a previous study in rhesus monkeys, Chinese-origin 
rhesus macaques have a greater genetic variation at the 
Mamu-DQBl locus and higher nucleotide diversity of 
mtDNA when compared with Indian-origin rhesus ma- 
caques [31,37]. For example, only 16 Mamu-DQBl alleles 
were identified from 107 Indian origin individuals born in 
captivity [38], whereas 23 Mamu-DQBl alleles were iden- 
tified from 119 Chinese rhesus macaques in this study. 

The high level of polymorphism usually observed in 
MHC genes is likely to be maintained by balancing selec- 
tion, driven largely by exposure to a diversity of pathogens 
[9-11]. In the present study, the Hanyuan population had 
conspicuously higher A R , haplotype diversity (h) and He 
values than any other populations (Table 2), exhibiting 
higher genetic diversity. This implied a strong balancing se- 
lection force that may have occurred in this population. In 
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Table 4 Analysis of molecular variance (AMOVA) of Mamu-DQBI sequences in five rhesus macaque populations 



Source of variation 


d.f 


Sum of squares 


Variance components 


Percentage of variation 


Fixation index {Fst) 


Among populations 


4 


14.706 


0.069821/a 


14.71 


0.14712* 


Within populations 


233 


94.315 


0.404791/6 


85.29 




Total 


237 


109.021 









Notes: Va, variance components among populations; Vb, variance components within populations; ^Significant value (P<0.05). 



addition, the Hanyuan population lives in the Xiangling 
Mountains which is a transition zone between the north 
and south of two distinct climates, the wet and cold 
Qionglai mountains in the north and the dry and hot 
Liangshan mountains in the south. Thus, the climate 
transition zone might also be a genetic intersection area 
of species, leading to higher genetic diversity than other 
nearby areas. However, the Heishui population exhibited a 
lower level of genetic diversity than other populations. 
In particular, Mamu-DQB 1*0614 and Mamu-DQB 1*1703 
are significantly more frequent in this population (Table 1). 
It is implied that a founder effect or genetic drift may have 
occurred in Heishui population. In addition, previous ana- 
lyses of MHC variation in species that span large geo- 
graphical ranges have emphasized the role of diversifying 
selection on MHC genes [39-42], thus we can not rule out 
the possibility of strong selection for the two alleles 
(Mamu-DQB 1*0614 and Mamu-DQB 1*1703) that are 
present at considerably higher frequencies in this popula- 
tion. Nevertheless, it is possible that there are additional 
MHC alleles that were not detected in Heishui population 
because of its smaller sample size. 

Disassortative mate choice may also contribute to the 
maintenance of MHC diversity [43,44]. Choosing part- 
ners with maximal or optimal genetic dissimilarity may 
help to avoid inbreeding or excessive outbreeding, 



Table 5 Genetic differentiation among five rhesus 
macaque populations 



Population 


JL 


HY 


DB 


XJ 


HS 


JL 




3.065 


2.473 


2.046 


1.533 


HY 


0.075** 




4.006 


4.255 


1.960 




0.0431 










DB 


0.092** 


0.059** 




6.175 


2.932 




0.0539 


0.0599 








XJ 


0.109** 


0.055** 


0.039* 




2.176 




0.0494 


0.0573 


0.0539 






HS 


0.140** 


0.113** 


0.079** 


0.103** 






0.0858 


0.0924 


0.0976 


0.0757 





Notes: Pairwise fixation index (F ST ) values calculated using allele frequency 
only (lower diagonal), DQB1-F sr values were in the first row; and microsate- F ST 
values were in the second row and were italic (from Li et al., [25]). Gene flow 
(A/m) values were in upper diagonal. Statistical significance is indicated by the 
asterisks (*P< 0.05, **P< 0.01). JL, Jiulong; HY, Hanyuan; DB, Danba; XJ, Xiaojin; 
HS, Heishui. 



respectively. To increase offspring diversity at key func- 
tional loci in a natural population, a recent study indicated 
that inbreeding avoidance was along with disassortative 
mate choice at DRB, but not at DQB locus [45]. Our re- 
sults also seem to support this point. For example, the 
Hanyuan and Jiulong populations had high levels of gen- 
etic diversity, but they also had the highest inbreeding co- 
efficients (Fis) at the Mamu-DQBI locus (Table 2). 

Balancing selection and trans-species polymorphism 

Balancing selection is expected to preserve high levels of 
polymorphisms at MHC loci by retaining alleles during 
species diversification events [46,47]. The d N ld s test for 
high rates of non-synonymous (d N ) compared to syn- 
onymous (d s ) substitutions (co = d N ld s ) is the most com- 
mon measure used to detect balancing selection acting on 
protein-coding genes. Polymorphisms within the ABS of 
MHC genes are thought to be maintained by balancing se- 
lection imposed by pathogens/parasites. Fast-evolving par- 
asites may adapt to the most common host genotype and 
escape presentation of their antigens to the adaptive im- 
mune system of the host [48]. In this study, the analysis of 
the nucleotide acid substitution pattern showed that 
Mamu-DQBI exon 2 sequences had been subjected to a 
strong balancing selection, but the selection signal was not 
significant in the Mamu-DQBI exon 2 sequences of west- 
ern Sichuan rhesus macaques. This may well be caused by 
genetic drift acting differently on the different populations, 
e.g., the Heishui population may has been subject to a gen- 
etic drift, leading to a low level of genetic diversity. Alterna- 
tively, this may also be due to the smaller sample size 
surveyed here. 

Under balancing selection, some allelic lineages exhibit 
unusual longevity that predates speciation events, leading 
to "trans-species polymorphisms" (TSP) used to describe 
polymorphisms among species [49] . In an extreme case of 
TSP, different species share identical alleles. Identical 
MHC gene sequences can be found in different species 
that have been separated by millions of years of independ- 
ent evolution [50]. This is remarkable, given that these 
genes are involved in a co-evolutionary arms race with 
parasites [51]. In the present study, the phylogenetic ana- 
lysis has illustrated that Mhc-DQBl allele sequences from 
rhesus and cynomolgus macaques tended to cluster to- 
gether according to allelic lineages rather than species or 
sample origins, leading to the sharing of allelic lineages by 
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Microstal allelic richness 



0.67 0.69 0.71 0.73 

Microstal heterozygosity 



Figure 4 Correlations between MhcMamu-DQBI allele and microsatellite locus variation. (A) Mean microsatellite heterozygosity vs. MHC 
heterozygosity. (B) Mean microsatellite allelic richness vs MHC allelic richness per population. 



different species. The result clearly confirmed the trans- 
species model of evolution of the Mhc-DQBl lineages. In 
particular, 14 Mamu-DQBl alleles matched exactly with 
the Mafa-DQBl alleles, suggesting that they may have oc- 
curred because of trans-species conservation after the di- 
vergence of the rhesus and cynomolgus macaque 1.8-2.0 
million years ago (mya) [52,53,28]. Alternatively, the iden- 
tical nucleotide sequences in the rhesus and cynomolgus 
macaques may have arisen by occasional interbreeding be- 
tween the two species. In fact, a recent study on genome 
sequencing and comparison of the cynomolgus and 
Chinese rhesus macaques showed that the cynomolgus 
macaque genome has been shaped by introgression after 
hybridization with the Chinese rhesus macaque [54]. The 
allele sharing between the rhesus and cynomolgus ma- 
caques has also been found in other MHC class II genes 
such as DQA1, DPB1, and DRB1 [38]. However, it is un- 
usual that the sharing phenomenon has not been by now 
detected in any MHC class I genes, which is a synthetic 
group in the same chromosome with class II genes. 

Genetic differentiation among populations 

AMOVA of the five populations indicated that variations 
mainly occurred among populations and that the Fst was 
moderate (0.14712) (Table 4). Fst represents the level of 
genetic differentiation among populations [55,56]: an Fst 
of 0 to 0.05 represents "little differentiation," 0.05 to 0.25 
"moderate differentiation," and values greater than 0.25 
"very great differentiation" [57] . Therefore, our results re- 
vealed that moderate genetic differentiation had occurred 
among the five rhesus macaque populations. To further 
assess genetic differentiation among the five populations, 
we also estimated Fst and Nm. Nm (also known as gene 
migration) is the transfer of genes from one population to 
another. Generally, if Nm < 1, genetic drift will result in 
substantial local differentiation; if Nm > 1, gene flow 



between populations is higher and the extent of genetic 
differentiation is smaller. When Nm > 4, gene exchange is 
more frequent and genetic differentiation is much smaller 
[58,59]. In this study, all values of Nm were larger than 1 
(1.533-6.175). Especially, the values of Nm among three 
populations (Danba, Xiaojin, and Hanyuan) were even lar- 
ger than 4, suggesting that gene exchange was more fre- 
quent among the three populations. 

One hallmark of animal mtDNA is strict maternal inher- 
itance. Previous study about mtDNA indicated poor gene 
exchange occurred in these five populations with the Nm 
values less than 1, thus gene exchange may occur mainly 
in nuclear genes. In other words, genetic exchanges mainly 
come from the migration of paternal macaques, which is 
consistent with male-biased migration behavior in various 
macaques. However, the levels of differentiation among 
the five populations were moderate. The differentiation 
were highest between the Jiulong and Heishui populations, 
and lowest between the Danba and Xiaojin populations. 
We speculated that genetic differentiation between these 
populations has arisen due to the geographical barrier of 
the high mountains, large rivers and long geographic dis- 
tances between populations. For example, Jiulong and 
Danba populations are located between the Yalongjiang 
and Dadu Rivers, but they are separated by the Gongga 
Mountains. It is difficult for rhesus macaques to cross 
these geographical barriers because of their vast width, 
depth, length, and flow. In fact, it has been shown that 
large rivers formed a natural barrier that obstructs gene 
exchange between many mammal populations [60,61]. 

Differences in parasite abundance and diversity in envi- 
ronments may result in selection for different sets of alleles 
in different populations. Loci under balancing selection are 
predicted to own a higher variation within population, but 
lower differentiation between populations when compared 
with neutral or nearly neutral markers, assuming that the 



Yao et al. BMC Evolutionary Biology 2014, 14:130 
http://www.biomedcentral.com/1471 -21 48/1 4/1 30 



Page 10 of 13 



selective pressures in the two populations are similar [62]. 
However, genes under directional selection present in- 
creased divergence between populations relative to neutral 
or nearly neutral markers if selective pressures differ between 
populations, or they may present decreased divergence rela- 
tive to neutral or nearly neutral markers if selection pres- 
sures are similar [63]. Thus, compared with neutral forces, 
balancing selection is supposed to diminish population dif- 
ferentiation as measured by conventional pairwise F ST . 
Hence, the population structure under balancing selection 
should not be distinct. In the present study, nine out of ten 
of the pairwise F ST values of Mamu-DQBl were greater than 
0.05 and eight pairwise F ST values were even greater than 
that of the microsatellites (Table 2). A significant positive 
correlation was found between allelic richness in Mamu- 
DQBl and microsatellites; a similar positive correlation was 
also found on pairwise F S t values, implying that genetic drift 
had played a significant role in maintaining MHC diversity 
for rhesus macaques. However, as mentioned above, the 
sharing of allelic lineages between species observed on our 
phylogenetic tree indicated a "trans-species polymorphism" 
under balancing selection, but the rhesus and cynomolgus 
macaques have diverged from each only 1.8-2.0 mya [53]. 
Therefore, the balancing selection acting on Mamu-DQBl 
may be historical. 

Conclusion 

It is widely accepted that variation at MHC loci is main- 
tained by balancing selection, even with a low level of neu- 
tral variability in some species [64,65], indicating the 
importance of balancing selection for maintaining genetic 
variation in the wild and also exposing the limitations of 
the neutral marker as substitutes for variation in fitness- 
related genes [66]. However, balancing selection may be 
overwhelmed by other microevolutionary forces such as 
genetic drift in small and isolated or bottlenecked popula- 
tions [67] . In this study, we detected evidence of a weaker 
balancing selection and found similar genetic variation 
pattern both at neutral and adaptive markers, suggesting 
that genetic drift was stronger than selection, thus leading 
to a reduction of MHC diversity in the local populations. 
AMOVA results also revealed that a moderate genetic dif- 
ferentiation had occurred in several segregated populations 
in western Sichuan, China, due to habitat fragmentation 
caused by long-term geographic barriers and human activ- 
ities. The Heishui population should be paid more attention 
for its lowest level of genetic diversity and relatively high di- 
vergence from others. This study provides deep insight into 
genetic variation of rhesus macaque populations in the 
fragmented habitat and helps to make an effective strategy 
for species conservation and facilitate the use of rhesus ma- 
caque in biomedical research. However, we have to ac- 
knowledge that the findings in this study are still very 
limited because of the smaller sample size as well as a single 



MHC gene locus. Therefore, to clarify thoroughly the gen- 
etic variation pattern of rhesus macaque populations in 
western China, further research is still needed using other 
MHC genes, other immune system loci and other markers, 
as well as more comprehensive sampling. 

Methods 

Samples and Genomic DNA extraction 

The whole blood samples were collected from 119 western 
Sichuan rhesus macaques at the National Experimental 
Rhesus Research Centre of Sichuan Agricultural University. 
All of these animals were the first-captured founder popula- 
tion from five different geographical populations in the 
wild, Jiulong (n = 20), Hanyuan (n = 28), Danba (n = 14), 
Xiaojin (n = 27), and Heishui (n = 30), located in western 
Sichuan, China (Figure 1). For each population, the samples 
are selected randomly to try to ensure that no correlation 
existed between animal individuals, i.e., they were not from 
the same family. All samples were collected in strict com- 
pliance with the Chinese Wildlife Conservation Act. Also, 
the Sichuan Agricultural University Ethics Committee pro- 
vided ethical approval for this study. Genomic DNA was 
extracted from samples according to the standard phenol- 
chloroform method [68]. Products were examined with 
1.0% agarose gel electrophoresis and were visualized under 
ultraviolet light. 

PCR and SSCP typing 

A pair of specific primers (Mamu-DQBIF 5'-GCC 
TGACTGACTGGACGGTGATTC-3' and Mamu-DQBIR 
5'-GGGGCGACAACGCTCACCTC-3') were used to amp- 
lify an approximately 317-bp DNA fragment containing the 
complete second exon of the Mamu-DQBl gene. For poly- 
merase chain reaction (PCR) amplification of Mamu- 
DQBl, all PCRs were performed in a reaction volume of 
10 ul, containing 5 ul of 2X ES Taq MasterMix (CWBIO, 
Beijing, China), 0.3 ul (10 pm/ul) of each DQB1 primer, 
roughly 20 ng template DNA, and distilled, deionized H 2 0 
to 10 ul. All PCR amplifications were carried out using a 
MyCycler™ Thermal Cycler (BIO-RAD, USA), with the 
first step at 94°C for 5 minutes, 30 cycles of 94°C for 30 sec- 
onds, 62°C for 25 seconds, 72°C for 45 seconds, and a final 
extension at 72°C for 10 minutes. Products were examined 
with 1% agarose gel electrophoresis and visualized under 
ultraviolet light. 

All positive PCR amplification products were screened 
for polymorphisms in DQB1 exon 2 by SSCP analysis. 
Two ul of the PCR sample was mixed with 8 ul of the 
denaturating buffer (95% formamide, 5% 6X loading buf- 
fer) before heating for 10 min at 98°C. The samples after 
denaturation were immediately cooled in ice water at least 
10 min, then loaded in a 2-mm-thick 12% acrylamide gel 
(no glycerol contained) and were electrophorized in 0.5X 
TBE at 130 V (15 h) at room temperature. The resulting 
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bands were visualized by silver staining. PCR products that 
showed the same SSCP pattern were counted and typed. 

Cloning and sequencing 

Representative samples from each population were ampli- 
fied with a reaction volume of 50 ul by PCR; the product 
was fractionated by electrophoresis in 18% agarose gels 
and purified using a Gel Extraction Kit (Sangon Biotech, 
Shanghai, China). Purified pieces of DNA were then 
cloned using the pMD19-T Vector kit (Takara, Dalian, 
China). Briefly, -2-3 ul of each positive purified DNA was 
added into each mixture containing 2.5 ul of Solution I 
and 0.5 ul of pMD19-T Vector. The mixtures were incu- 
bated at 16°C for 40 min. For transformation, we added 
25 ul of Escherichia coli DH5a-competent cells to the in- 
cubated ligation mixture, and then we put this on ice for 
30 minutes, heat-shocked the cells at 42°C for 52 seconds 
in a water bath, immediately put this on ice for at least 
3 minutes, and added 800 ul 37°C LB media. This mixture 
was then incubated at 37°C with shaking at approximately 
150 rpm for 80 min, and recovery cells were mixed with 
7 ul IPTG and 40 ul X-Gal, then 160 ul of the mixture was 
plated onto LB/ampicillin plates and incubated at 37°C 
overnight (14-16 hours). After bluewhite selection, a 
number of positive clones were picked and grown, and 
10-12 clones were chosen and sequenced by the sequen- 
cing company (Invitrogen, Shanghai, China). To avoid er- 
rors arising from PCR-based recombination, apart from 
the optimization of the PCR program, we have taken sev- 
eral measures to determine a real sequence: 1) a sequence 
must be detected in at least 2 animal individuals; and/or 
2) if a sequence was found in only one animal individual, 
it must be confirmed by at least 2 independent PCRs. 

Data analysis 

Mamu-DQB 1 diversity analyses 

The sequences obtained from this study were initially ad- 
justed using DNAStar software (DNASTAR Inc., Madison, 
WI). Multiple sequence alignments were created using the 
Clustal W program in DAMBE software [69]. Nucleotide 
diversity (n) and haplotype diversity (h) were calculated 
using DnaSP (Version 5) [70]. Allelic frequency distribu- 
tion, expected (He) and observed (Ho) heterozygosities, 
pairwise F ST and genetic flow (Nm) among populations 
were estimated with GenAlEx 6.5 [71]. Inbreeding coeffi- 
cient (F IS ) and allelic richness (A R ) per population were cal- 
culated with the FSTAT program, version 2.9.3 [72]. Allelic 
richness is a measure of the number of alleles independent 
of sample size. Hence, it can be compared between samples 
of different sizes. 

Selection pressure and phylogenetic analysis 

Non-synonymous (d N ) and synonymous (d s ) substitution 
rates at Mamu-DQBl amino acid sites were calculated 



according to the Nei-Gjobori method in Mega 5.05 [73]; 
the Z-test for selection was used to evaluate whether d N 
was significantly greater than d s for entire sequences. 
The ABS and non-ABS were distinguished based on 
HLA-DQB1 structure [74]. Phylogenetic trees were re- 
constructed for Mhc-DQBl allelic sequences using Mega 
5.05, based on the neighbor- joining method with Kimura 
2-para model. The reliability of the neighbor-joining tree 
was estimated by bootstrapping, in which 1000 bootstrap 
pseudosamples were used. To expand our data and com- 
pare with the closely related species, an additional 46 
Mamu-DQBl, 85 Mafa-DQBl (Macaca fascicularis), 
and 2 Aona-DQBl (Aotus nancymaae) alleles were also 
retrieved from the Immuno Polymorphism Database. 
Thus, a total of 156 Mhc-DQBl gene sequences includ- 
ing 23 sequences identified in this study were used in 
the phylogenetic analysis, and Aona-DQBl was used as 
an out-group. Information for all these sequences is 
shown in Additional file 1: Table SI. 

Mamu-DQBl genetic differentiation analyses 

The degree of population subdivision was estimated on 
the basis of hierarchical analysis of molecular variance 
(AMOVA) using the Arlequin version 3.1 package [75]. 
This package was used to compare the component of gen- 
etic diversity from the variance among the sampled major 
geographical regions and among subpopulations within 
each region. In order to measure the role of balancing se- 
lection in maintaining MHC variation among the popula- 
tion, the microsatellite data sets of Li et al. [25] that 
analyzed microsatellite variation at 30 loci in six popula- 
tions (including all populations in this study) were used, 
and correlations between Mhc-DQBl and microsatellite 
data were assessed using the Pearson product-moment 
correlation in SPSS 16.0. 

Availability of supporting data 

The data sets supporting the results of this article are avail- 
able in the Dryad repository, doi: 10.506 l/dryad.h8b93 [76]. 

Additional file 



Additional file 1: Table SI. Mhc-DQBl alleles of different primate species 
used in this study A total of 156 Mhc-DQBl alleles from three primate species 
{Macaca mulatto, Macaca fascicularis, Aotus nancymaae) were used in this 
study, including 69 Mamu-DQBl, 85 Mafa-DQBl and 2 Aona-DQBl alleles. 
Alleles detected in this study are shown in bold. IPD Acc No. is accession 
number in the IPD-MHC Database (http//www.ebi.ac.uk/ipd/mhc/). 
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